On Long-range-dependence Synthetic Data Generation through Heavy-tailed Distributions
نویسنده
چکیده
Comprehensive analysis of the Internet traffic conducted over the last decade revealed the concept of longrange dependence (LRD) in the collected data traces. Some of this analysis work proved that this property is a result of the heavy tailed distribution of the packet inter-arrival time, which is a result of the heavy tailed distribution of packet size of the traffic sources. In this paper, we use this fact to build a traffic generator model that is based on multiplexing sources of heavy-tailed inter-arrival times and then counting the number of arrivals in the successive periods of length T. To come up with a parameterized model, we also study the effect of the heavy tailed distribution parameters (i.e. the Pareto distribution), and the time scale along with the number of multiplexed sources on the LRD of the generated traffic. Our model is different from the on/off models proposed in the literature that need to multiplex a very large number of sources, while we need only a few sources. Our findings are; 1. The LRD level is higher for sources with heavier tails; 2. There is a need to multiplex only a few sources; 3. The LRD level is higher for larger time scales; 4. Fixing the time scale and tuning the tail of the distribution helps us to come up with a parameterized linear model that only has one parameter, which is the Hurst parameter.
منابع مشابه
Long Range Dependence in Heavy Tailed Stochastic Processes
The notion of long range dependence has traditionally been deened through a slow decay of correlations. This approach may be completely inappropriate in the case of a stochastic process with heavy tails. Yet long memory has been reported to be found in various elds where heavy tails are a standard feature of the commonly used stochastic models. Financial and communications networks data are amo...
متن کاملAccuracy and Computational Efficiency on the Fractal Traffic Generation
The use of synthetic self-similar traffic in computer networks simulation is of vital importance for the capturing and reproducing of actual Internet data traffic behavior. A commonly used technique for generating selfsimilar traffic is achieved by aggregating On/Off sources where the active (On) and idle (Off) periods exhibit heavy tailed distributions. This work analyzes the balance between a...
متن کاملCharacterization of multivariate heavy-tailed distribution families via copula
The multivariate regular variation (MRV) is one of the most important tools in modeling multivariate heavy-tailed phenomena. This paper characterizes the MRV distributions through the tail dependence function of the copula associated with them. Along with some existing results, our studies indicate that the existence of the lower tail dependence function of the survival copula is necessary and ...
متن کاملA Single Channel on - off Communication Model
1.1 INTRODUCTION There now exist several large teletraac data sets exhibiting non-standard features incompatible with classical assumtions of short range dependence and rapidly decreasing tails. For instance, it is worth exploring the variety of data catalogued at the ITA web site www.acm.org/sigcomm/ITA/. These data sets exhibit the phenomena of heavy-tailed marginal distributions and long ran...
متن کاملTCP Congestion Control and Heavy Tails
Long-range dependence has been observed in many recent Internet traffic measurements. Previous studies have shown that there is a close relationship between heavy-tailed distribution of various traffic parameters and the long-range dependent property. In this paper, we use a simple Markov chain model to argue that when the loss rate is relatively high, TCP’s adaptive congestion control mechanis...
متن کامل